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Abstract 

The Bernoulli sieve is a version of the classical 'balls-in-boxes' occupancy scheme, in 
which random frequencies of infinitely many boxes are produced by a multiplicative 
renewal process, also known as the residual allocation model or stick-breaking. We 
focus on the number Kn of boxes occupied by at least one of n balls, as n — )• oo. A 
variety of limiting distributions for Kn is derived from the properties of associated 
perturbed random walks. Refining the approach based on the standard renewal 
theory we remove a moment constraint to cover the cases left open in previous 
studies. 



1 Introduction and main result 

In a classical occupancy scheme balls are thrown independently in an infinite series of 
boxes with probability pk of hitting box k = 1,2, . . . , where {pk)keN is a fixed collection of 
positive frequencies summing up to unity. A quantity of traditional interest is the number 
Kn of boxes occupied by at least one of n balls. In concrete applications 'boxes' correspond 
to distinguishable species or types, and Kn is the number of distinct species represented in 
a random sample of size n. Starting from Karlin's fundamental paper [19], the behaviour 
of Kn was studied by many authors [3l 13 ITTl |20]. In particular, it is known that the 
limiting distribution of Kn is normal if the variance of Kn goes to infinity with n, which 
holds when p^s have a power-like decay, but does not hold when p^s decay exponentially 
as — )■ oo [7]. See [5l HD] for survey of recent results on the infinite occupancy. 

Less explored are the mixture models in which frequencies are themselves random 
variables {Pk)kmj while the balls are allocated independently conditionally given the fre- 
quencies. The model is important in many contexts related to sampling from random 
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discrete distributions, and may be interpreted as the occupancy scheme in random envi- 
ronment. The variabihty of the allocation of balls is then affected by both the randomness 
in sampling and the randomness of the environment. With respect to Kn-, the environ- 
ment may be called strong if the randomness in (P^,) has dominating effect. One way to 
capture this idea is to consider the conditional expectation 

oo 

i?: := I {Pk)) = 5^ E(i - (1 - p,r) 

k=l 

and to compare fluctuations of Kn about -R* with fluctuations of i?*. By Karlin's [19] 
law of large numbers, we always have Kn ~ -R* a.s. (as n — )■ oo) so the environment may 
be regarded as strong if the sampling variability is negligible to the extent that i?* and 
Kni normalized by the same constants, have the same limiting distributions, see ^13j for 
examples. 

In this paper we focus on the limiting distributions of Kn for the Bernoulli sieve 
[HI m [12] , which is the infinite occupancy scheme with random frequencies 

Pk:=W^W2---Wi,_^{l-Wu), ken, (1) 

where {Wk)km are independent copies of a random variable W taking values in (0,1). 
From a viewpoint, Kn is the number of blocks of regenerative composition structure 
[H [13] induced by a compound Poisson process with jumps | log Wk\- Discrete probability 
distributions with random masses ([1]) are sometimes called residual allocation models, 

the best known being the instance associated with Ewens' sampling formula when W = 
beta(c, 1) for c > 0. Following P [12], frequencies ([T]) can be considered as sizes of the 
component intervals obtained by splitting [0, 1] at points of the multiplicative renewal 
process {Qk '■ k G No), where 

j 

Qo:=l, :=n^- ^eN. 

i=l 

Accordingly, boxes can be identified with open intervals {Qk, Qk-i), and balls with points 
of an independent sample f/i,...,f/„ from the uniform distribution on [0,1] which is 
independent of {Qk)- In this representation balls i and j occupy the same box iff points 
Ui and Uj belong to the same component interval. 

Throughout we assume that the distribution of |logiy| is non- lattice, and use the 
following notation for the moments 

fi:=E\logW\, := Var(logiy) and i/ := E| log(l - VT) |, 

which may be finite or infinite. 

Under the assumptions u < oo and cr^ < cxd, the CLT for Kn was proved in [9J by 
using the analysis of random recursions. Assuming only that z/ < cxd in [12] a criterion of 
weak convergence for Kn was derived from that for 

p*{x) := inf{A; G N : H^i . . . VT^ < e"^}, x > 0. (2) 
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In this paper we treat the cases of finite and infinite u in a unified way, and obtain the 
hmiting distribution of Kn directly from the properties of the counting process 

N*{x) := #{A; G N : Pfe > e-^'} (3) 
= #{keN:Wi---Wk-i{l-Wk)>e-^}, x > 0, (4) 

in the range of small frequencies (large x). Although this approach is familiar from 
[51 [12], the application to the Bernoulli sieve is new. We emphasize here that the connec- 
tion between Kn and N*{x) remains veiled unless we consider the Bernoulli sieve as the 
occupancy scheme with random frequencies (a random environment), and the process of 
occupancy counts Kn is analyzed conditionally on the environment. Thus we believe that 
the present paper offers a natural way to study the occupancy problem, since the method 
is based on a direct analysis of frequencies and calls for generalizations. Our main result 
is the following theorem. 

Theorem 1.1. // there exist functions f : R+ — )■ ]R_|_ and g : R+ — )■ M such that {p*{x) — 
g{x))/ f{x) converge weakly (as x ^ oo) to some non- degenerate and proper distribution, 
then also {Xn — &n)/on converge weakly (as n oo) to the same distribution, where Xn 
can be either Kn or N*{\ogn), and the constants are given by 

/■log n 

bn= / gi\ogn-y)¥{\\og{l-W)\edy}, a„ = /(logn). 
Jo 

In more detail, we have the following characterization of possible limits. 

Corollary 1.2. The assumption of Theorem 11.11 holds iff either the distribution of 
I logH^I belongs to the domain of attraction of a stable law, or the function P{| logiy| > x} 
slowly varies at oo. Accordingly, there are five possible types of convergence: 

(a) //o"^ < oo then, with 

bn = fi-^ ilogn - J F{\log{l-W)\> x}dxj (5) 

and an = (/i~^(T^ log n)-*^/^, the limiting distribution of {Kn — bn)/an is standard 
normal. 

(b) //cr^ = oo, and 

/ y'^F{\\ogW\ e dy} ~ L(x) x ^ oo, 
Jo 

for some L slowly varying at oo, then, with bn given in ([5]) and an = /i~^^^C[iogn]; 
where (c^) is any positive sequence satisfying lim nL{cn)/c^ = 1, the limiting dis- 

tribution of {Kn — bn)/an is standard normal. 

(c) // 

F{\logW\>x} ~ x-°L{x), X ^ oo, (6) 
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for some L slowly varying at oo and a G (1, 2) then, with bn given in and an = 
^-(Q!+i)/a^ i^Q^'^ Q^^y pQsitive seQuence satisfying lim nL(c„)/c" = 1, 

the limiting distribution of {Kn — bn)/cin is a-stable with characteristic function 
t ^ exp{-|t|'T(l -a)(cos(7ra/2) +ism(7ra/2)sgn(t))}, t e M. 

(d) Assume that the relation ([6]) holds with a = 1. Let r : M — t- M 6e any nondecreasing 
function such that limxP{| logiy| > r(x)} = 1 and set 



m[xi 



log W^l > y}dy, x > 0. 



Then, with 



log n 



log n — y 



P 



and 



m{r{i\ogn — y) /miXogn — y))) 
r{\ogn/m{\ogn)) 



log(l - W) 



G dy 



" milogn) ' 

the limiting distribution of {Kn — bn)/a,n is 1-stable with characteristic function 

t ^ exp{-|t|(7r/2 - zlog |t| sgn(t))}, t G M. 

(e) // the relation ([6]) holds for a G [0, 1) then, with 6„ = and an '■= log" n/L(logn), 
the limiting distribution of Kn/o,n is the Mittag-Leffler law 9a with moments 



x^ 9aidx) 



Tf^il - a)r{l + ak)' 



keN. 



Define /„ to be the index of the last occupied box, which is the value of k satisfying 
Qk < min(f/i, . . . , f/„) < Qk-i, and let L„ := /„ — Kn be the number of empty boxes 
with indices not exceeding /„. From [12] we know that the number L„ of empty boxes 
is regulated by /i and u via the relation lim„_>oolEL„ = z///x (provided at least one of 
these is finite), and that the weak asjTiiptotics of coincides with that of p*(logr;,), i.e. 
(/„ — bn)/an and (p*(logn) — bn)/an have the same proper and non-degenerate limiting 
distribution (if any). In [9] [12] it was shown that under the condition u < oo the weak 
asymptotics of Kn coincides with that of In, hence with that of p*(logn). That is to 
say, when u < oo the way L„ varies does not affect the asymptotics of Kn through the 
representation Kn = In — Ln- Clearly, this result is a particular case of Theorem 11.11 
because when z/ < oo 



lim 

x—>oo 



g{x) - /; g{x - y) P{| log(l -W)\e dy} 
fix) 







(7) 



(see Remark 13.11 for the proof). 

Theorem 11.11 says that in the case 1^ = 00 the asymptotics of Ln may affect the 
asymptotics of Kn, and this is indeed the case whenever ([7]) fails, hence a two-term 
centering of Kn is indispensable. The following example illustrates the phenomenon. 
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Example 1.3. Assume that, for some 7 G (0, 1/2), 



F{W>x} = - --4^ ^, XG[0,1). 

1+ log(l -x)^ ^ ^ 



Then 



Elog^iy<oo and P{| log(l - > s} ~ a;"^ as a; ^ 00, 
and in this case, 

a„ = const log"'^''^ n and 6„ = /i^"'^(logn — (1 — 7)^Mog^^''' n + o(log^~''' n)). 

Thus we see that the second term 6„ — logn of centering cannot be ignored. Moreover, 
one can check that 

1 ^ EM/^^ 1 

EL^^-V"— — ~ 6„-/i~Mogn ~ — -log^^'^n, 

/c /i(l-7) 

which demonstrates the substantial contribution of L„. 

We shall make use of the poissonized version of the occupancy model, in which balls 
are thrown in boxes at epochs of a unit rate Poisson process. The variables associated with 
time t will be denoted K{t),R*{t), etc. For instance, the expected number of occupied 
boxes within time interval [0,t] conditionally given (Pk) is 

00 00 
R*it) = Y.{e-Hyn\)R: = J]E(1 - 6"*^'=). 

n=0 k=l 

The advantage of the poissonized model is that given (P^) occupation of boxes 1,2,..., 
as t varies, occurs by independent Poisson processes of intensities Pi, P2, • • • • 

The variable N*{x) is the number of sites on [0,x] visited by a perturbed random 
walk with generic components | log 1^1, | log(l — W)\. We shall develop therefore some 
general renewal theory for perturbed random walks, which we believe might be of some 
independent interest. The approach based on perturbed random walks is more general 
than the one exploited in [T2] and is well adapted to treat the cases u < 00 and u = 00 
in a unified way. 



2 Renewal theory for perturbed random walks 
2.1 Preliminaries 

Let {C,k, Tik)km be independent copies of a random vector (^, rj) with arbitrarily dependent 
components ^ > and r] > 0. We assume that the law of ^ is nonlattice, although 
extension to the lattice case is possible. For {SkjkeNo ^ random walk with = and 
increments ^k, the seqeunce (Tk)k£N with 

Tk := Sk-i + Ilk, keN, 
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is called a perturbed random walk (see, for instance, [2], [IH Chapter 6], [TS]). Since 
lim Tfc = oo a.s., there is some finite number 

k—^oo 

N{x) := e N : Tfc < x}, x > 0, 
of sites visited on the interval [0, x]. Let 



R(x) := - exp (-xe-^'=) V x > 0. 



Our aim is to find conditions for the weak convergence of, properly normalized and cen- 
tered, N{x) and R{x), as x — )■ oo. 

It is natural to compare A^*(x) with the number of renewals 

p(x) := #{A; ENq: Sk<x} = mi{k eN: Sk> x}, x > 0. 

In the case E?7 < oo weak convergence of one of the variables (p(x) — g{x))/ f{x) and 
(A^(x) — g{x))/ f{x) (with suitable /, g) implies weak convergence of the other to the same 
distribution. Our main focus is thus on the cases when the contribution of the rj^s does 
affect the asymptotics of N{x). To our knowledge, in the literature questions about the 
asymptotics of perturbed random walks were circumvented by imposing an appropriate 
moment condition which allowed reduction to (Sk) (see, for instance, [HI Chapter 6], [16] . 
[211 Theorems 2.1 and 2.2]). 

Recall the following easy observation: for x,y > 

p(x + y)- p(x) < p'(x, y) = p{y), (9) 

where p'{x,y) := mf{k — N{x) E N : Sk — Sp(^x) > v}- Furthermore, {p'{x,y) : y > 0) is 
independent of p(x) and has the same distribution as {p{y) '■ y >0). 

Denote U{x) = Ep(x) = YlT=o^{^k < x} the renewal function of (Sk). From ([9]) and 
Fekete's lemma we have 

U{x + y)-U{y)<Ciy + C2, x,y>0, (10) 

for some positive constants Ci and C2. 

The next lemma will be used in the proof of Theorem 12.51 

Lemma 2.1. // weakly converges then 



y 9{x)-g{x-y) n 1 11 f 1 ■ n^\ 
hm Jl^ ~ locally umjormly in y, [11) 



and, for every A E 



j.^ lo ajx - y) dG{y) - f^^^^ g{x + \- y) dG{y) ^ ^ 
™ fix) 



x^oo 



for arbitrary distribution function G with G{0) = 0. 
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Proof. For fixed / call gi,g2 /-equivalent if lim gi(^) g^(^) = q. Clearly fllip is a property 

of the class of /-equivalent functions g. 

We refer to the list of possible limiting laws and corresponding normalizations for p{x) 
[121 Proposition A.l]. Relation f lTT]) trivially holds when g{x) = 0. It is known that g{x) 
cannot be chosen as zero if the law of ^ belongs to the domain of attraction of the a-stable 
law for a G [1, 2]. It is known that for ^ in the domain of attraction of a stable law with 
a G (1, 2] one can take g{x) = x/E^ which satisfies (ITT]) . 

Thus the only troublesome case is the stable domain of attraction for a = 1. According 
to [H Theorem 3], one can take 

9{x) 



m{r{x/m{x))) ' 

where m{x) := > y}dy, and r{x) is any nondecreasing function such that lim xF{^ > 

x^oo 

r{x)} = 1. The concavity of m(x) implies that x x/m{x) is nondecreasing. Thus 
X I—)- m{r{x/m{x))) is nondecreasing too as superposition of three nondecreasing func- 
tions. Hence, for every 7 G (0, 1), 

g{ix) > -fg{x), X > 0, 

which readily implies subadditivity via 

g{x) + g{z) > ( — ^ + ]g{x + z) = g{x + z). 

\X + Z X + z J 



Thus, 



qix) — qix — y) 
lim sup ' — < 0. 



For the converse inequality for the lower limit it is enough to choose non-increasing g from 
the /-equivalence class, and by [H, Theorem 2] this indeed can be done by taking inverse 
function to x 1— i- xm{r{x)). 

The stated uniformity of convergence is checked along the same lines, and (fT2|) follows 
from the subadditivity of b and easy estimates. □ 



2.2 The case without centering 

We start with criteria for the weak convergence of p{x) and R{x) in the case when no 
centering is needed. 

Theorem 2.2. ForY{x) any of the variables p{x), N{x) or R{e^) the following conditions 
are equivalent: 

(a) there exists function /(x) : M+ — > M+ such that, as x ^ 00, F(x)//(x) weakly 
converges to a proper and non- degenerate law, 

(b) for some a G [0, 1) and some function L slowly varying at 00 

>x} ~x~"L(x), x^oo. (13) 
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Furthermore, if fll3p holds then the limiting law is the Mittag-Lejfler distribution 6a, and 
one can take f{x) = x°'/L{x). 

The assertion of Theorem 12.21 regarding p{x) follows from fl2[ Appendix]. For the 
other two variables the result is a consequence of the following lemma. 

Lemma 2.3. We have 

N(x) 

lim — -— = 1 in probability 

x-yco p[xj 

and 

R(x) 

lim — = 1 in probability. 

p(logx) 



a;— >oo 



Proof. By definition of the perturbed random walk 

pix) 

for < ?/ < X. Clearly, p(x) t oo a.s. and 

p{x -y)> p{x) - p\x -y,y) a.s. (15) 
with p' as in ([9]), from which 

p{x-y) p 

— — > 1, x — )■ oo. (16) 

p(a;) 

Finally, by the strong law of large numbers we have 

lim — = Pjr] > y} a.s. 

x^oo p(x) 

Therefore, dividing f|T4|) by p(x) and letting first a; — oo and then y — )■ oo we obtain the 
first part of the lemma. 

As for the second part, we use the representation 

/oo 
(1 -e-^/^)dAr(logi/) 

= f N{\ogx-\ogy)e'ydy-{l-e~'')N{Q). (17) 
Jo 

Since N{x) is a.s. non- decreasing in x we have, for any a < x, 

fX pa 

/ A^(logx — log?/)e"^d?/ > / A^(logx — log?/)e~^dj/ > A^(logx — loga)(l — e""). 
Jo Jo 

Dividing this inequality by p(logx), sending x — )■ oo along with using (fT6|) and the already 
established part of the lemma, and finally letting a — )■ oo, we obtain the half of desired 
conclusion. 
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To get the other half, write 

/ N{\ogx-\ogy)e-ydy < p(logx)(l - e"^) (18) 
Jo 

+ / (p(loga;-log?/) -p(logx))e^^d?/, 

where ([9]), the inequahty N{x) < p{x) a.s., and the fact that p{y) is a.s. non- decreasing 
in y have been used. Since, by ( ITOl) . 



E / (p(logx - logy) - p(loga;))e ^dy < / {Ci\\ogy\ + €2)6 ^dy < 00, 
Jo Jo 

then dividing f|T8l) by p(logx) and sending x — ?■ 00 completes the proof. □ 
2.3 The case with nonzero centering 

Now we turn to a more intricate case when some centering is needed. We denote F{x) 
the distribution function of rj and U{x) the renewal function of (Sk). 

We will see that a major part of the variability of N{x) is absorbed by the renewal 
shot-noise process (M(x) : x > 0), where 

p{x)~l 

M{x) := J2 Fix-Sk), x>0. 

k=0 

Lemma 2.4. We have 

e(n{x)-M{x)] =[ F{x-y){l-F{x-y))dU{y), 







which implies that, as x ^ 00, 

2 



e(^N{x) - M{x)^ = o(^j^{l-F{y))dy^=o{x). 



(19) 



Proof. For integer i < j, 



E(^l{5,:<x}(l{5.+r,,+i<x} - F{X - S,))l{s,<x}{l{S,+v,+i<x} " F{x - Sj)) (^fc, ??fe)Li^ 

l{5,<x}(l{Si+,,i+i<x} - F{x - Si))lis,<x} (^F{x - Sj) - F{x - Sj)^ = 0. 
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Hence, 

2 



OO y \ 2 

= E ^ l{5fc<x} f l{5fe+r?fe+i<x} - - Sk) j 
= Ef2l{s,<.}(F{x-Sk)-F'{x-Sk)] 

k=0 ^ ^ 

= [ F{x~y){l-F{x-y))dU{y). 
Jo 

If E?7 < oo, then by the key renewal theorem, as x — )■ oo, 

\imE( N{x) - M{x)] = / F{y){l - F{y))dy < oo, 

x^oo \ J Jo 

where a := E^ may be finite or infinite. If Kt] = oo and a < oo, a generahzation of the 
key renewal theorem due to Sgibnev ^22j, Theorem 4] yields 

e(^N{x) - M{x)^ ^ a-^ {1- F{y))dy. 

Finally, if Krj = oo and a = oo a modification of Sgibnev's proof yields 

e(^N*{x)-M{x)^ = o(^j\l~F{y))dyy 

Thus (fT9|) follows in any case. □ 
Theorem 2.5. If for some random variable Z 

p{x) -g{x) 



fix) 

then also 



A Z, x^oo, (20) 



M{x)-j;g{x-y)dF{y) , 

W) ^ ' ' ^ ^ 

N(x) - g(x-y)dF(y) a , . 

V ; Jo yv y± vy; ^ x ^ oo, (22) 



and 

R{x)-j;;''g{\ogx~y)dF{y) 
/(logx) 



4 Z, X ^ oo. (23) 
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Proof. Integrating by parts yields 

M{x)= [ F{x-y)dp{y) = -F{x)+ I p{x-y)dF{y), 



so to prove (^T]) it is enough to show that, as x — > oo, 

Jo J[^) 

For any fixed 6 G (0,x) we may decompose T{x) as 

From the proof of Lemma 12.11 we know that without loss of generality it can be assumed 
that g{x) is nondecreasing. Thus, almost surely, 

p{x)~g{x) p{x)-p{x-5) 

-^(^^^'^ m — ^^'^ 

< Ti(x) 

p{x) - g{x) g{x) - g{x- 5) 

2 + — 7w — 

In view of ([9]) and ( ITTj) . we have the convergence lim limTi(x) = Z in distribution. 

(5— >oo X— j-oo 

For X > set 
and 

Z, := {Z.,{t) : t > 0). 



We will establish next that 

sup {p{y) - g{y)) 



^ — — — = sup ^^^(t) A sup 2'(t), X — )■ oo, (24) 

te[o,i] te[o,i] 



and, similarly. 



inf {p{y) -g{y)) 

, = inf ZJt) A inf Z(t), x^oo. (25) 

/(x) te[o,i] te[o,i] 

Case 1: g{x) = x/KC,. Then Z is an a-stable random variable for some a G (1,2] 
fl2\ Proposition A.l]. Denote hj Z = {Z{t) : t > 0) a stable Levy process such that 

Z{1) = Z. Regard and Z as random elements of Skorohod's space D[0, oo) endowed 
with the Mi-topology. 
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By [HI Theorem lb], 

Z, X ^ oo. (26) 

Since the supremum functional is Mi-continuous, we obtain f l2^ and fl25l) using the con- 
tinuous mapping theorem. 

Case 2: g[x) ^ x/E^. Then Z is a 1-stable random variable. Set Z = {Z{t) : t > 0), 
where 

Z{t) = Zi{t) -tlogt, t>0, 

and {Zi{t) : t > 0) is a stable Levy process such that Zi{l) = Z. With this notation we 
derive fl26|) from |T5l Theorem 2], from which fl24|) . f l25|) follow along the above lines. 
Now it remains to estimate 

inf (p{y) - g{y)) inf (p{y) - g{y)) 
< ^^^^ 

sup (p(i/) - ^(y)) 

sup {p{y) - ^(y)) 
< ^^^^^^^=^7^;;^ (fW-fW). 

Using fl24l) and fl25l) , we conclude that lim lim T2 {x) = in probability. The proof of fl21l) 

S—^oox-^oc 

is complete. 

In view of (IT^ . E(M(x) — A^(x))^ = o(x). Since a^(x) grows not slower than x (see 
[T2| Proposition A.l]), Chebyshev's inequality yields 

iV(a;)-M(x) p ^ 

Now ([22]) follows from (12T|). 

It remains to establish (ESI)- To this end, introduce for x > 1 



Q^{x) := ^ e-^(A^(logx) - A^(logx - \ogy))dy > 0, 
Q2(a;) := [ e~y{N(logx - logy) - N(\ogx))dy > 0. 

Using 

PX pX 

EN{x)= / F{x-y)dU{y) = -F{x)+ / U{x-y)dF{y) 
Jo Jo 

and ( ITU]) , we conclude that for y G (l,x), 

EA^(logx) - EiV(logx - logy) < ^1(1 + F(0)) logy + 6*2(1 + F(0)). 
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Therefore, EQi(x) = 0(1), as x ^ oo, whence 4 0. Similarly, ^ 0. 



Thence, recalling ([Tj 

Qi{x)-Q2{x) {l-e-^)N{logx)-R{x)-{l-e-^)N{0) p ^ 

m ^ W) " °' 

As A^(logx) grows in probability not faster than logarithm, we conclude that 

N{logx)-R{x) P ^ 
/(logx) 

Now an appeal to ( l22i) completes the proof. □ 



3 Proof of Theorem 11.1 

Set 

S'*:=0 and S*^ := \logWi\ + . . . + \logWk\, keN, 

and 

T*:=Sl_, + \logil-Wk)l keN. 
The sequence {T^)ke'N is & perturbed random walk. Since 

p*{x) = mf{k eN:Sl>x}, N*{hgx) := #{A; G N : < logx}, 

an appeal to Theorem 12.21 (case g = 0) and to Theorem 12.51 (case g ^ 0) proves the result 
for A^*(logn). To prove the statement for Kn we shall use the poissonization. 



Step 1. We first check that 



limEVar(ir(t)|(P,)) = ^, (27) 

t^oo p 



which is for /i = oo. Plainly, this will imply that 

K{t)-E{K{t)\iPk)) 

for any function q{t) such that limg(t) = oo. 
According to formula (25)], 

oo 

Var(ir(t)|(Pfc)) = J2 i^''"^' - e"'*''^) • 

k=l 

With U*{x) := T.T=o^{S*k < and ip{t) := Ee-'^^-^) we obtain 

oo /■ 

EVar(/s:(t)|(Pfc)) = E^ ((^(te-^^*~i) -^(2te~^fe-i) 

k=l ^ 



A 0, (28) 







^{te-^) - ip{2te-'') ]dU*{x) 
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which is the same as 

EVar(K(e^)|(Wfe)) = / A{x - y)dU*{x). (29) 



for A(t) := <y9(e*) — ip{2e^), t G M. To proceed, observe that 



oo „_2(1-VK) _ p-2z{l-W) 



dz = log 2, 

z 

which imphes that Aii) is integrable, since by Fubini's theorem, 

oo g_z(l_Vl/) _ ^-2^(1-11^) 

E / d2 = loK2. 



Furthermore, arguing in the same way as in [121 Section 5] we can prove that A{t) is 
directly Riemann integrable. Therefore, application of the key renewal theorem on M to 
([29D yields 

Chebyshev's inequality together with fl271) imply that, for every £ > 0, 
limP{|A^(t) - ^{K{t)\{Pk))\ > £q{t)\{Pk)} = in probability, 

t—^oo 

which proves (1251) upon taking expectation and invoking the Lebesgue bounded conver- 
gence theorem. 

Step 2. Step 1 implies that weakly converges to a proper and nondegenerate 

probability law if and only if = ^ weakly converges to the same law. 

Using this observation. Theorem 12.21 (in case g = 0) and formula ( l23l) of Theorem 12.51 
(in case g 0) we conclude that weak convergence of to some distribution 6 

implies the weak convergence of both 

R*{t) - Cgj^ogt - y) P{| log(l - W)\ e dy} 
/(logt) 

and 

K{t) - /o°^'^(logt - y) P{| log(l - W)\ g d7/} 
/(logt) 

to e. 

Step 3. It remains to pass from the poissonized occupancy model to the fixed-n model. 
Since /(logt) is slowly varying, and in view of (fT2|) . 

h{t) := r'^(logt-i/)P{|log(l-l^)| edy} 
Jo 

satisfies 

Un.MWl±£)])^„ 

t^oo f (log t) 
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for every e > 0. We thus have 

Kit)-bi[til±e)\) 



x±{t) 



/(log(Lt(l±5)J)) 



Let Ct be the event that the number of balls thrown before time t lies in the limits from 
[(1 — e)t\ to [(1 + By monotonicity of K^, we have 



X_(t) > X4t)lz, 

^ i^L(i-^)tj-&(Lt(l-g)J) ^ 



Since f{Ct) — )■ 1, we conclude that 



, Kn — bin) 
0[x, oo) > lim sup P< — — — > X 



f{\ogn) 

for all X > 0. To prove the converse inequality for liminf, one has to note that 

4 0, 

and proceed in the same manner. The proof is complete. 

Remark 3.1. Here is the promised verification of (^^. Below we use terminology introduced 
in the proof of Lemma 12. 1[ 



Lemma 3.2. Relation is a property of the class of f- equivalent functions g 



Proof. Assume that g satisfies ([7]). We have to show that any gi such that lim ^^^^^ 



satisfies ([7]), as well. 

Plainly, it is enough to check that 

A{x) := — -rr-. — )■ 0, X — )■ cxD. (30) 

For any e > there exists xq > such that for all x > xq ^^^^Y{x)^^^^ ^ ^' ^^^^^ / 
regularly varying with index (3 G [1/2, 1], without loss of generality we can assume that 
/ is nondecreasing. Hence 

Jo f{x - y) 

+ r \3{x~y)-gi{x-y)\ ^^ 

ye[0,xo] J[y) 

Sending x — )■ oo and then e 10 proves (|5Ui) . □ 
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If the law of | log Vl^| belongs to the domain of attraction of an a-stable law, a G (1, 2] 
then {p{x) — g{xj)l f{x) weakly converges with g{x) = x/ n and appropriate /(x). Such 
a g trivially verifies ([?]) which, by Lemma I3.2[ entails that every gi from the same /- 
equivalence class verifies ([7]). 

If the law of | log 14^1 belongs to the domain of attraction of a 1-stable law, then 
{p{x) - g{x))/f{x) weakly converges for g{x) = and f{x) = with 

m and r as defined in part (d) of Corollary 11.21 Since r is regularly varying with index 
one, without loss of generality we can assume it and hence g are differentiable. Since 
is regularly varying with index (—1), it converges to 0, as a; — )■ oo. Besides that, 
lim xF{( > x} = in view of u < oo, where we denoted | log(l — W) \ by (. Hence, 

x—^oo 

g{x)¥{C > x} 

^™ 7r~\ = 

x^oo }[X) 



Thus it suffices to check that 



-^(x -C))l{c<x} 



lim ^"^^ %7 , ^n^^,^., ^ ^ ^3^) 

x^co j[x) 

Now subadditivity and differentiability of g can be exploited in order to show that 

\g{.x) - g{y)\ < K\x - y\, x,y>0, 

where K := l/m(r(l)). This immediately implies (13T|) and the whole claim by virtue of 
Lemma 13.21 
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